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Introduction 


Much  of  the  real  time  signal  processing  in  radar  applications  has  to  be  done  in 
analog  in  order  to  achieve  the  required  high  speed.  However,  the  accuracy  in  an  analog 
circuit  is  directly  related  to  its  limited  dynamic  range.  Some  signal  processing 
applications  (radar  mainlobe  and  sidelobe  nulling)  require  accuracies  that  cannot  be 
achieved  via  analog  processing.  Digital  processing  accuracies,  however,  are  limited  by 
the  number  of  bits  used  in  the  calculations.  The  accuracy  of  a  digital  processor  is  not 
limited  to  that  of  its  internal  word  length.  Through  the  use  of  appropriate  algorithms,  an 
unlimited  number  of  bits  can  be  used  in  the  .alculations  it  performs,  thus  yielding 
unlimited  precision  (with  a  tradeoff  in  s|>ecd).  The  inherent  parallcii.sm  of  optics  can  be 
exploited  to  reali/.e  high  speed  parallel  processors.  This  concept  was  propo.sed  by 
OptiComp  Corporation  to  the  Office  of  Naval  Research,  contract  N00014-87-C-(X)77^ 
This  architecture  is  based  on  the  ability  to  generate  Boolean  Sum  of  Minterms  or 
Sum  of  Products  (SOP)  in  a  Systolic  manner.  The  generation  of  the  SOP  can  be  done 
by  imaging  a  Spatial  Light  Modulator  (SLM)  onto  a  second  SLM  to  perform  the  logical 
AND  operation  and  using  a  cylindrical  lens  to  sum  each  column  onto  a  linear  detector. 
Applying  a  proper  threshold  to  the  output  of  the  detector  yields  a  logical  OR  operation. 
Figure  1-1  shows  rn  word ;  applied  to  two  Sl.MS  with  height  ni  and  width  n.  Fach  word 
consists  of  n  bits,  and  thus  the  detector  is  n  elements  long.  The  output  at  the  detector  C- 
yields: 

n  n  n  n  n 

By  rotating  the  cylindrical  lens  and  the  detector  90  degrees,  and  using  a  m  element 
detector  yields: 


So,  by  rotating  a  lens,  a  different  set  of  operations  tU'c  performed.  However,  in  this  case, 
the  logical  ORing  across  the  bits  of  the  same  word  is  not  vei7  useful,  and  would  not  be 
a  practical  configuration  of  this  architecture.  Note  that  for  every  data  word  loaded,  an 
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AND  followed  by  an  OR  operation  is  periormed.  If  one  wanted  to  logically  OR  two 
words  together,  but  not  do  an  AND  operation,  both  words  could  lie  input  to  SLM  A,  The 
remainder  of  the  SLM  A  would  be  all  zeroes.  SLM  B  would  be  set  to  all  ones.  To  do 
just  a  logical  AND  of  two  words,  one  word  would  be  input  to  SLM  A,  the  other  to  SLM 
B  (.same  row  in  each  SLM),  While  this  works,  the  true  parallelism  of  this  architecture 
is  hardly  exploited  as  only  one  operation  on  two  binary  words  is  carried  out  when  the 
SLMs  are  loaded. 

Another  configuration  which  offers  more  parallelism  is  shown  in  Figure  1-2. 
Here,  each  pair  of  data  words  are  loaded  into  SLM  A,  and  instructions  dictating  what  type 
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Figure  1-2;  Logic  Instructioas  Acting  On  Parallel  Loaded  Data 


of  operation  on  those  words  are  loaded  into  SLM  B.  Thus,  one  could  fill  each  SLM  v'ith 
data  and  instructions  that  could  be  done  simultaneously.  One  drawback  to  this  approach 
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is  that  only  one  logical  operation  is  being  executed-the  logical  OR.  The  second  SLM 
only  selects  which  bits  of  the  two  words  are  to  be  ORed,  To  do  an  AND  operation,  one 
has  to  make  use  of  DeMorgan’s  theorem: 

+^. 

Note  this  requires  using  the  complement  of  the  variables  for  input.  However,  this  type 
of  architecture  has  no  capability  for  directly  complementing  bits.  If  an  AND  operation 
is  to  be  performed,  the  data  would  have  to  be  complemented  before  being  input  to  the 
SLM.  When  an  OR  operation  is  required,  no  complementing  would  be  done. 

Systolic  processing  consists  of  a  number  of  processors  regularly  interconnected. 
These  individual  processors  cairy  out  a  small  numlicr  of  functions,  typically  in  one  clock 
cycle  and  are  referred  to  as  F^roccssing  Elements  (PEs).  The  interconnection  may  be 
linear  (.see  Figure  l-.l),  or  a  2D  mesh' ^  Data  flows  into  a  PH,  and  its  results  along  with 
the  data  flow  into  the  next  PE.  Since  the  data  is  alst>  pas.sed  from  PH  to  PE,  this 
distingui.shes  a  linear  systolic  processor  from  a  pipelined  proces.sor'  ^  Referring  to  F’igure 
l-.l,  note  that  although  the  data  may  enter  and  exit  .sequentially,  when  the  proces.sor  is 
filled  (all  PEs  are  operating  on  some  data  .siuiultaneousiy),  there  exists  pmallelism  which 
can  be  quite  high  if  there  are  a  large  number  of  PEs.  For  example,  if  data  enters  at  a 
clock  rate  of  10  MHz  into  a  linear  systolic  processor  that  lias  10  PEs,  the  equivalent 


PE  -  PE  ■  ►PE 

Figure  1-3:  Linear  Systolic  Array 

processing  power  would  be  that  of  a  100  MHz  sequential  proces.sor  (10  PEs  working  at 
10  MHz  each).  Note  two  ideal  conditions  exist:  the  I/O  data  rate  to/from  the  processor 
is  low,  but  the  overall  priKessing  bandwidth  is  high,  and  each  PE  consi.sts  of  a  low 


bandwidth,  simple  processor.  The  tradeoff  to  this  lype  of  architecture,  however,  is  that 
it  is  not  a  general  purpose  processor.  Its  applications  are  in  signal  processing  such  as 
Finite  Iirpulse  Response  filters,  deconvolution  and  matrix  multiplication.  Systolic 
processors  are  generally  "hard  wired"  to  a  paiticular  algorithm  and  thus  cannot  be 
reconfigured  to  perform  some  other  task. 

2  Processor  Description 

The  Sy.stolic  Processor  propo.sed  for  this  evaluation  consists  of  two  multi-channel 
Acoustic-Optic  (AO)  cells  for  data  input,  and  a  cylindrical  lens  which  does  the  logical  OR 
function  onto  a  linear  photodiode  array  (.sec  Figure  2-1).  Data  is  input  to  the  SLMs  from 
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Figure  2-1:  Acoustic-Optic  Digital  Sy.stolic  Proce.s.sor 


a  central  controller.  The  output  data  from  the  photodiode  array  is  thrcsholded,  buffered 
and  .sent  back  to  the  controller.  The  controller  could  be  a  general  purpose  electronic 
computer  for  reprogrammability,  a  finite  state  machine  for  speed,  or  a  hybrid  of  the  two. 


Figure  2-2  is  a 
diagram  of  one  channel  of 
the  AO  cell.  The 
transducer  is  fed  a  digitally 
modulated  RF  carrier  (RF 
pulse  train).  The  vibrating 
transducer  creates  an 
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acoustic  wave  which  Figure  2-2:  A  Single  AO  Cell  Channel 
traverses  the  ci7stal.  This 

acoustic  wave  changes  the  index  of  refraction  of  the  material  creating  a  phase  grating 
which  diffracts  the  incident  light.  Thus,  this  RF  pulse  train  is  transformed  into  a  moving 
train  of  optical,  digital  data. 

Referring  back  to  Figure  2-1,  AO  cell  A’s  diffracted  optical  data  is  imaged  onto 
AO  cell  B.  Thus,  if  a  logical  1  is  present  on  AO  cell  A,  light  will  be  incident  on  AO  cell 
B.  If,  at  that  point  in  time  AO  cell  B  has  a  logical  1  also,  the  light  will  be  diffracted  and 
imaged  onto  the  linear  photodiode  array.  Data  is  placed  in  AO  cell  A  (word  parallel)  and 
instructions  in  AO  cell  B,  and  they  will  propagate  toward  each  other  at  a  rate  that  is 
dependent  on  the  type  and  cut  of  the 
crystal.  Figure  2-3  is  a  block  diagram  of 
a  top  down  view  of  the  AO  cells  A  and  B, 
the  cylindrical  lens  and  the  photodiode 
array.  Data  (DO  ...  D4)  is  input  .o  SLM 
A,  and  instructions  (10  ...  14)  are  entered 
into  SLM  B.  At  first,  DO  and  10  interact. 

At  the  next  time  interval,  DO  interacts 
with  II  and  D1  interacts  with  10.  Thus, 
each  word  of  the  instructions  .serially 
interacts  with  each  data  word  which  is  a 

form  of  systolic  piocessing,  —  - 

Figure  2-3:  Top  Down  View  of  Figure  2-1 
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Recall  from  the  Introduction,  this  processing  architecture  cannot  perform  negation 
on  its  own.  Thus,  the  data  is  entered  in  dual  rail  format.  Each  bit  and  its  complement 
is  entered  into  the  first  AO  cell  so  that  the  instructions  have  access  to  the  data  words  and 
their  complements  simultaneously. 


3  System  Setup 

To  perform  the  evaluation,  a  subset  of  the  processor  was  fabricated  (see  Figure 
3-1).  The  controller  and  input/output  buffers  were  omitted.  Periodic  waveforms  from  a 
pul.se  generator  were  used  as  data  inputs  to  the  SLMs  to  test  for  alignment  of  the  system. 
The  photodiode  airay  was  placed  on  a  printed  circuit  board  which  was  mounted  on  an 
XYZ  translation  stage  which,  in  turn,  was  mounted  to  a  rotation  stage.  RG188  coaxial 
cable  connected  each  diode  of  the  aitay  to  a  patch  panel  of  BNC  connectors.  This  patch 
panel  allowed  probing  of  each  photodiode,  and  isolated  the  connections  from  the  printed 
circuit  board  so  the  alignment  would  not  be  disturbed.  A  digital  scope  with  an  internal 
50  ohms  resistance  was  connected  to  the  patch  panel  and  the  scope’s  internal  resistance 
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converted  the  photodiode’s  current  into  a  useable  voltage  waveform  and  eliminated  the 
need  for  a  high  speed  transimpedance  amplifier  circuit.  Load  resistances  of  50  ohms 
integrated  with  BNC  connectors  were  placed  on  all  unused  diode  outputs  to  keep  the 
crosstalk  among  the  photodiodes  to  a  minimum. 

A  diagram  of  the  optical  layout  is  shown  in  Figure  3-2.  A  10  mW  HeNe  laser 
was  used  as  the  source  in  the  setup.  The  la.scr  beam  passed  through  collimating  and 
expanding  optics  and  imaged  a  16  channel  Acoustic-Optic  cell  onto  another  and  then 
focused  down  onto  the  photodiode  array.  The  first  diffracted  order  of  the  AO  cell  was 
imaged  onto  the  second  AO  cell.  All  diffracted  orders  from  each  cell,  except  the  first 
needed  to  be  filtered  out  (Aperture  Stops).  The  first  attempt  at  imaging  the  system 
involved  expanding  the  laser  to  fill  the  entire  aperture  of  the  AO  cell,  but  the  intensity 
of  the  light  incident  on  the  photodiode  array  was  too  low  to  provide  an  adequate  signal. 
Then,  a  cylindrical  lens  was  used  to  focus  a  vertical  line  through  the  cell’s  aperture  so 
each  channel  had  a  small  intense  beam.  Since  we  only  had  a  32  element  photodiode 


array,  there  would  be  at  most  32  bits  which  would  not  fill  the  entire  aperture.  This 
focused  beam  would  yield  more  optical  power  incident  on  each  pixel  of  the  photodiode 
array.  Focusing  a  line  on  the  cell  and  using  a  telescope  to  image  this  line  onto  the  second 
cell,  however,  decreased  the  resolution  between  the  zero  and  first  diffracted  order 
(smeared  them  together)  to  the  point  where  this  setup  was  not  useable. 

Instead,  the  laser  was  split  into  two  beams  illuminating  the  up^xtr  and  lower 
channels  of  the  AO  cells.  This  would  not  illuminate  all  channels  of  the  AO  cells.  A 
fully  illuminated  system  would  require  multiple  lasers  illuminating  different  channels  of 
the  AO  cells.  It  was  felt  this  would  sufficiently  test  the  capabilities  of  the  optics  to 
adequately  focus  the  light  onto  the  detector  array  as  more  distortion  occurs  the  fanher  an 
image  is  from  the  optical  axis  (which  passes  through  the  center  of  the  AO  cells). 
Therefore,  channel  16  on  the  first  AO  cell  was  lined  up  with  channel  15  on  the  second 
cell  since  channel  16  on  the  .second  cell  was  not  functioning  within  specifications. 
Similarly,  channel  2  on  the  first  cell  (channel  1  was  below  specs)  was  lined  up  with 
channel  1  on  the  .second  cell.  In  order  to  provide  light  to  two  channels,  the  la.ser  beam 
was  collimated  and  split  by  a  beamsplitting  cube.  The  beam  that  was  split  reentered  the 
optical  train  at  a  lower  level  via  min'or  M2. 


4  RF  Drivers  Characterization 

The  RF  drivers  were  manufactured  by  Newport  Electro-Optics  Systems.  Each 
driver  is  capable  of  delivering  300  mW  at  a  center  frequency  of  80  MHz.  The  output  is 
switchable  b<.‘twcen  Continuous  Wave,  CW,  and  externally  modulated.  The  input  to  the 
external  modulation  is  TTL  compatible. 

The  CW  output  power  of  each  driver  was  measured  using  im  RF  spectrum 
analyzer  and  a  HP  8496B  attenuator  (to  keep  the  analyzer’s  front  end  from  being 
overloaded).  Almost  all  outputs  had  to  be  adjusted  to  bring  them  within  specifications. 
The  output  power  of  each  channel  also  drifted  after  a  day,  but  not  enough  to  affect  the 
experiment.  To  be  safe,  an  RF'  power  meter  was  u.scd  to  monitor  the  output  power  levels. 
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The  extinction  ratio  was  obtained  by  calculating  the  ratio  of  the  output  power 
when  the  driver  was  switched  from  CW  to  off.  The  off  state  was  obtained  by  switching 
the  driver  to  external  modulation  with  a  logic  0  input  to  the  external  modulation  port. 
The  average  value  found  was  65  dB  which  was  more  than  adequate  for  this  experiment. 

The  spectral  purity  of  the  drivers  was  checked  by  setting  the  driver  to  CW,  and 
applying  the  output  to  an  RF  attenuator  connected  to  an  RF  spectrum  analyzer  and 
measuring  the  amplitude  of  the  first  harmonic  frequency  relative  to  the  center  frequency. 
Poor  spectral  purity  will  result  in  added  harmonics  distorting  the  sinusoid  (see  Figure 
4-1).  The  average  value  was  -17.5  dBc  (dB  below  the  center  frequency). 

The  risetime  of  the  RF  pulses  was  measured  by  modulating  the  driver,  and 
applying  the  output  to  an 
RF  attenuator  connected  to 
a  digital  scope.  The 
inverted  output  of  the  pulse 
generator  was  used  to 
trigger  the  scope  so  the 
edges  of  the  modulation 
envelope  would  be 
observed  (otherwise,  the 
scope  would  trigger  on  the 
RF).  The  display  of  the 
scope  w'as  set  to  envelope 
mode  which  displays  the  minimum  and  maximum  points  of  a  Wciveform.  After  a  couple 
of  seconds,  the  modulation  envelope  of  the  RF  pulse  would  appear  because  the  unsynched 
80  MHz  frequency  contained  in  the  envelope  would  trace  it  out.  The  pulse  generator’s 
output  is  designed  to  drive  a  50  ohm  load,  but  because  of  the  TTL  load  of  the  RF  driver 
it  is  connected  so  there  would  be  an  additive  reflection  at  the  input  of  the  RF  driver. 
Therefore,  the  amplitude  of  the  pulses  was  .set  to  approximately  2.2V  which,  when 
reflected,  became  approximately  4.4V  at  the  input  of  the  driver.  Later,  when  the  BNC 
tee  connectors  v.ere  used  to  drive  extra  channels,  the  puLses  were  degraded  because  the 


Figure  4-1:  RF  Waveforms  From  The  Drivers 


reflections  from  each  load  (the  drivers)  were  not  arriving  back,  at  the  pulse  generator  at 
the  same  time.  Instead,  a  50  ohm  load  was  connected  to  each  driver  input.  Although  this 
would  present  a  25  ohm  loa».  to  the  pulse  generator,  the  reflections  were  minimal,  and 
better  shaped  pulses  from  the  pulse  generator  were  obtained. 

Four  drivers  were 
selected  which  had  the  best 
spectral  purity  and  rise/fall 
time  characteristics.  The 
drivers  seiected  were 
channels  6  and  7  on  driver 
285,  and  channels  4  and  7 
on  driver  280.  Each 
channel’s  envelope  was 
stored  in  the  digital  scope’s 
memory,  and  the  envelopes 
of  the  selected  drivers  are 
shown  in  Figures  4-2  and 
4-3. 

5  Acoustic-Optic 
Cells  Characterization 


The  AO  ceils  lue  16 
channel  cells  manufactured 
by  Newport  EOS.  The 
material  is  Tellurium 
Oxide.  The  center 
frequency  for  the  cells  is 
80  MHz  and  the  optimum 


Figure  4-3:  Modulation  Envelopes  of  The  RF  Pulses 
From  The  Drivers 


wavelength  is  680  nni  I'he  Lithium  Niohate  transducers  have  a  length  and  width  of  Ih 
mm  and  0,^  mm  resiKvtively.  I'heir  spacing  is  2  mm  resulting  in  an  overall  height  ot 
45  mm  for  the  cell.  The  cell’s  width  is  12  mm. 

The  difiraction  efTiciency  or  each  chunuel  of  the  AO  cells  was  measured  using  the 
setup  in  Figure  5-1.  The  cell  is  rotated  to  the  Bragg  angle  with  respect  to  the  incident 
laser  which  produces  the  maxitnum  amount  of  diffraction  into  the  first  order,  With  no 
RF  applied  to  the  input  of  the  channel  being  measured  (no  diffraction),  the  resulting 
optical  power  was  measured.  Then,  RF  power  was  applied  and 
the  resulting  first  order  optical  power  was  measured.  The  efficiency  is: 

•  First  Order  Optical  Power  .rvAn/ 

Ejficiency= - i - a  100% 

Nondiffracted  Optical  Power 

The  average  diffraction  efficiencies  for  each  AO  cell  are  71%  for  AO  cell  0050  and  65% 
for  AO  cell  0051. 

The  propagation  velocity  of  the 
acoustic  wave  travelling  across  the  cell 
was  measured  by  using  the  setup  shown  in 
Figure  3-1.  Arbitrarily  long  pulses  from 
the  pulse  generator  were  used  to  modulate 
a  channel  of  one  AO  cell  while  the  driver 
to  the  other  cell  was  set  to  CW.  Three 
consecutive  detectors  were  observed  on 
the  digital  scope.  The  difference  in  delay 
from  adjacent  photodiodes  corresponds  to 
the  time  it  took  for  the  acoustic  wave  to  propagate  from  one  photodiode  to  the  other  (see 
Figure  5-2).  Since  the  photodiode  width  is  known  (150  pm),  the  propagation  velocity  is 
this  width  divided  by  the  delay  time  (34.5  ns).  This  resulted  in  a  propagation  velocity 
of  4.3  km/s  which  compares  favorably  with  the  specified  4.26  km/s. 


LASER 


AO  CELL 


OPTICAL 

POWER 
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Figure  5-1:  Setup  For  Measuring 

Diffraction  Efficiency 
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l  ijiurc  5-2:  Acoustic  Wave  Delay 


2  ns.  Tlie  array  is  lensed  which  results  in  minimal  dead  space  between  pixels’,  Its 
operaiing  voltap,e  is  ,^(K)  V. 

The  rise/l'all  titnes  of  the  detectors  were  tneaaured  by  applying  optical  pulses  with 
rise/fall  times  of  approximately  1  ns  from  the  York  laser  ditxle  to  the  detector  array.  The 
output  of  the  anay  was  applied  to  the  50  ohm  input  of  the  digital  scope.  Tlie  average 
rise/fall  times  measurcd  were  2  ns. 

The  unifonnity  of  the  responsivity  of  the  photodiode  array  was  checked  by 
applying  a  uniform  intensity  light  source  across  the  entire  array,  and  measuring  the 
resultant  output  current.  To  eliminate  the 
Guassian  intensity  profile,  a  HeNe  laser 
was  expanded  from  a  diameter  of  0.68 
mm  to  approximately  4,4  cm  and 
collimated  using  a  65X  microscope 
objective  and  a  10pm  pinhole.  The 
photodiode  array  was  reverse-biased  with 
2(X)V  and  all  elements  not  being  measured 
were  terminated  with  50  ohm  loads.  To 
test  a  panicular  detector,  the  50  ohm  load 


Figure  6-1:  Transiinpedanee  Amplifier 
Circuit 


was  a'lnovcd  t'nmi  thai  element,  a  transimpedance  amplifier  (see  Figure  6-1)  with  l(X)k 
iransinnvdancc  was  inserted  in  its  place  to  convert  the  pbouxiiode’s  output  current  to  a 
voltage,  and  the  amplifier’s  output  voltage  was  measured  with  a  Fluke  digital  multimeter. 
This  method  was  rejxtated  fttr  each  of  the  elements. 

The  output  signal  for  all  32  elements  was  measured  three  times  and  averaged, 
resulting  in  a  graph  of  the  output  voltage,  Vo,  versus  Element  #  (see  Figure  6-2).  Note 
a  sinusoidal  response  is  noticeable  across  the  photodiode  array.  The  light  source  was 
checked  for  uniformity  by  monitoring  the  output  of  a  single  photexiiode,  and  translating 
the  entire  array  horizontally  across  the  incident  la.ser  beam.  Fluctuations  in  the  amplitude 
of  the  photodiode  current  were  observed.  Also,  a  la,ser  beam  analyzer  showed 
interference  patterns  which  were  caused  by  reflections  from  the  lenses  in  the  optical 
system.  Element  #4  showed  a  greater  response  than  other  elements.  Further  study 
showed  that  this  element  produced  current  when  the  outside  edge  of  element  #1  was 
illuminated  with  a  fiber  light  source.  This  was  attributed  to  reflections  occurring  inside 
the  package  of  the  photodiode  array.  This  also  occurred  in  the  other  photodiode  array, 
and  was  more  pronounced.  The  affected  elements  were  not  needed  for  this  experiment. 


Figure  6-2:  Average  Responsivity  of  The  Photodiode  Array 
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System  Performance  Results 


To  evaluate  the  performance  potential  of  this  architecture,  the  data  input  '•ate  to 


the  system  would  be  increased  until  the 
system  experienced  crosstalk  from 
adjacent  photodiodes.  The  system  data 
input  used  were  two  repetitive  waveforms. 
The  first  waveform  was  a  .square  wave. 
Mote  the  output  is  analogous  to  an  analog 
convolution  of  the  two  inputs  to  the  AO 
cells  and  the  photodiode  array  bccau.se  of 
the  fact  the  two  inputs  propagate  towards 
each  other  and  across  the  photodiode 
array.  Referring  to  Figure  7-1,  the 
expected  output  when  one  AO  cell  is  CW 
and  the  other  has  a  .square  wave  is  a 
rc|x.*tiiivc  triangle  wave.  With  a  .square 
wave  on  both  AO  cells,  the  output  will  be 
a  triangle  pul.se  waveform  shown  in 
Figure  7-2.  If  one  of  the  inputs  does  not 
align  properly  in  time  with  the  other  input 
and  the  photodiode  array,  a  trapezoidal 
waveform  will  result.  Note  once  the 
proper  waveforms  are  obtained,  this  will 
show  only  vertical  alignment  of  the 
system.  The  alignment  in  the  horizontid 
direction,  perpendicular  to  the  optical  axis, 
may  be  off  an  integer  number  of  elements 
in  the  photodiode  array.  To  achieve  this 
alignment,  a  different  input  waveform  is 


Figure  7-1:  Expected  Output  Waveforms 
When  One  AO  Cell  is  Input  CW  And  The 
Other  a  Square  Wave 


Figure  7-2:  Expected  Output  Waveforms 
When  Both  Inputs  Are  Square  Waves 


required.  The  waveform  is  shown  in 
Figure  7-3  with  its  conesponding  expected 
output  waveform.  Once  the  previous 
alignment  is  obtained,  this  alignment  can 
be  obtained  by  either  translating  one  of 
the  AO  cells  or  delaying  one  of  the  inputs 
to  the  AO  cells  relative  to  the  other  input 
(using  the  delay  capability  of  the  pul.se 
generator).  Since  nothing  is  to  be  gained 
by  this  alignment  (the  crosstalk  shows  up 
in  the  previous  alignment),  it  was  not 
done  in  this  experiment. 

The  alignment  proceeded  in  two 
phases.  In  the  first  phase,  only  a  single 
beam  passed  through  the  center  of  the  optics,  keeping  distortion  (and  the  number  of 
variables)  to  a  minimum.  During  this  alignment,  other  diffraction  orders  were  reaching 
the  photodiode  array.  The  patterns  would  change  when  the  pulse  widths  of  the  data 
inputs  to  the  AO  cells  were  changed.  These  orders  were  due  to  a  second  order  grating 
being  established  by  the  repetitive  spacing  of  the  phase  grating  in  the  AO  cell.  Figure 
7-4a  shows  a  typical  channel  of  an  AO  cell.  The  vertical  dotted  lines  depict  the  acoustic 
wave  propagating  across  the  crystal.  When  the  spacing  between  the  input  RF  pulses  to 
the  AO  cell  become  small  enough,  light  begins  to  diffract  (see  Figure  7-4b).  This  was 
verified  by  observing  the  diffracted  output  beam  with  a  laser  beam  analyzer  while  varying 
the  pulse  widths  of  the  data.  The  analyzer  showed  a  diffraction  order  "spot"  move  as  the 
pulse  width  was  changed.  The  filter  slit  was  narrowed  to  remove  these  orders. 

Once  satisfactory  iveforms  were  obtained  (see  Figure  7-5),  two  beams 
illuminating  the  upper  and  lower  channels  of  the  AO  cells  were  used.  Note  the  triangle 
pulse  points  down  as  oppo.sed  to  the  waveform  shown  in  Figure  7-1.  This  is  because  of 
the  reverse  bias  of  the  photodiodes.  The  data  rate  was  high  enough  to  produce  crosstalk 
in  the  output  once  the  system  was  aligned.  Figure  7-6  shows  the  resulting  output 
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Figure  7-3:  Expected  Output  Waveforms 
When  Both  Inputs  To  The  AO  Cells  Are 
Repetitive  Pulses 


waveforms.  Note  the  center  of  the  zero  has  a  small  triangle  (a  "one")  indicating  the 
crosstalk.  There  was  not  enough  optical  power  incident  on  the  edges  of  the  photodiode 
array  to  generate  observable  waveforms.  Thus,  only  21  of  the  32  elements  of  the 
photodiode  array  were  able  to  be  observed.  The  input  data  rate  to  each  channel  of  the 
AO  cells  was  approximately  30  Mb/s. 
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Figure  7-5:  Output  Waveforms  With  One  AO  Cell  Input  CW,  The 
Other  Modulated  By  a  Square  Wave 
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Figure  7-6:  Output  Waveforms  When  Both  AO  Cells  Are  Modulated 
By  Square  Waves 


8  Conclusions 

The  data  rate  used  in  this  experiment  required  bit  widths  on  the  order  of  35  ns. 
Recall  the  RF  pulse  waveforms  shown  in  Figures  4-2  and  4-3.  It  is  clear  we  were 
stressing  the  capabilities  of  the  RF  drivers.  The  trailing  and  leading  edges  of  the  pulses 
are  not  only  poor,  they  are  very  different  from  each  other.  Furthermore,  the  modulation 
envelopes  of  the  four  RF  pulses  are  not  identical.  This  accounts  for  the  alignment 
difficulties  experienced  when  two  channels  of  the  AO  cells  were  illuminated.  It  is  evident 
the  RF  drivers  play  an  important  role  in  the  success  of  this  type  of  architecture. 

The  instructions  for  this  processor  are  low  level  Boolean  instructions  which 
determine  what  logical  OR  or  logical  AND  operations  are  to  be  perfonned.  They  are  not 
the  high  level  add  or  multiply  instructions  which  are  used  in  electronic  processors. 
Therefore,  it  would  not  be  appropriate  to  refer  to  the  computational  capability  of  this 
processor  as  MIPS.  Since  21  elements  were  aligned,  this  results  in  an  overall 
computational  capability  of  630  Million  Logical  Instructions  Per  Second  (MLIPS). 
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To  perform  an  n  bit  by  n  bit  addition  using  Full  Adders  (F/As)  with  ripple  carry 
requires  a  Half  Adder  (H/A)  plus  n-1  F/As^.  A  H/A  requires  one  EXOR  gate  and  an 
AND  gate.  Since  this  processor  cannot  do  an  EXOR  operation,  the  EXOR  gate  can  be 
expanded  to  its  equivalent  form  using  AND  and  OR  gates,  increasing  the  gate  count  to 
4.  Similarly,  it  takes  9  logic  gates  to  perfonn  the  function  of  a  F/A.  Therefore,  it  takes 
4  4-  9(n-l)  logic  gates  to  pei  orm  the  addition.  Thus,  for  a  16  bit  addition,  139  logic 
operations  are  required.  With  an  overall  computational  capability  of  630  MLIPS,  the 
processor  could  do  4.5  million  additions  per  second  (630  MLIPS/139  gates).  The 
overhead  for  multiplication  is  worse.  For  an  n  bit  by  n  bit  multiplication  done  by  forming 
jiartial  products  and  using  F/As  to  sum  these  products  to  produce  the  answer  requires  n^ 
AND  gates  to  generate  the  piulial  products  and  n(n-l)  F/As^  This  results  in  a  total  gate 
count  of  2,416  logic  gates  for  a  16  bit  multiplications  which  yields  260  thousand 
multiplies  |rer  .second  which  is  hardly  imprc.ssive.  Note  these  numbers  arc  assuming 
1(X)^}('  efficiency  in  the  utilization  of  the  processor  (it  is  filled  at  all  times). 

A  potential  bottleneck  to  this  processor  is  the  I/O  to/froni  it.  To  make  this  system 
workable,  amplifiers  and  thresholding  logic  and  buffers  would  be  required  at  the  output. 
The  detector  has  a  -3  dB  frequency  cutoff  of  175  MHz  (0.35/risetime).  This  equates  to 
a  maximum  data  rate  of  350  Mb/s.  At  data  rates  greater  than  this,  the  necessary  buffci’ 
electronics  and  photodiode  amplifiers/thresholding  logic  becomes  expensive  and  divncult. 
At  this  data  rate,  to  perform  100  million  16  bit  multiplications  per  .second  would  require 
AO  cells  with  64  channels  (2  channels  per  bit  for  the  dual  rail  and  placing  both  the 
multiplier  and  the  multiplicand  on  the  AO  cell).  The  processor  would  be  required  to 
process  at  a  rate  of  2.416  X  10'  MLIPS  (2,416  operations  per  multiply  times  100  million 
multiplies  jx;r  second).  To  achieve  this,  a  photodiode  airay  with  690  elements  is  required 
(2.416  X  lO'  MLIPS  divided  by  350  Mb/s  per  photodiode).  This,  of  course,  would 
require  buffering  690  data  lines  at  a  clock  rate  of  350  MHz  which  is  impractical.  To 
lower  the  number  of  data  lines  from  the  output  to  a  more  manageable  number  of  64 
would  require  the  increa.se  of  the  data  rates  by  over  an  order  of  magnitude. 
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operational  capability,  readiness,  and  supportability; 

c.  Provides  a  full  range  of  technical  support  to  Air  Force  Materiel 
Command  product  centers  and  other  Air  Force  organizations; 

d.  Promotes  transfer  of  technology  to  the  private  sector; 

e.  Maintains  leading  edge  technological  expertise  in  the  areas  of 
surveillance,  communications,  command  and  control,  intelligence,  reliability 
science,  electro-magnetic  technology,  photonics,  signal  processing,  and 
computational  science. 


The  thrust  areas  of  technical  competence  include:  Surveillance, 
Communications,  Command  and  Control,  Intelligence,  Signal  Processing, 
Computer  Science  and  Technology,  Electromagnetic  Technology, 
Photonics  and  Reliability  Sciences. 


